Evaluations of HPF for Practical Scientific Algorithms on T3E

نویسنده

  • Chris H. Q. Ding
چکیده

HPF coding examples of practical scientific algorithms are examined in detail, with the idea that on these simple but non-trivial examples, we can fairly well understand issues related to different data distributions, different parallel constructs, and different programming styles (static vs dynamic allocations). Coding examples include 2D stencils solution of PDEs, N-body problem, LU factorization, several vector/matrix library routines, 2D and 3D array redistribution.The performances of HPF codes are close to hand-written message passing MPI codes, for LU factorization, vector/matrix routines, array redistributions. But for less regular data/communication patterns, the stencils calculation and the N-body problem, HPF codes perform considerably less efficient, about 2-4 times slower. Scaling of HPF codes is not as good as MPI codes. Some of the HPF codes performances are highly inconsistent, i.e., minor change of code could results in factor of 10 change in performance. Many peculiarities of HPF coding will be discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

About the Performance of HPF: Improving Runtime on the Cray T3E with Hardware Specific Properties

High Performance Fortran permits to write parallel programs with much less programming effort than by using standard communication libraries such as MPI or PVM. The performance of compiled HPF programs is considered low, though. We show that a compiled HPF application will gain a substantial runtime improvement if compilation incorporates properties of the hardware architecture into the final p...

متن کامل

High Performance Fortran for practical scientific algorithms: An up-to-date evaluation

A suite of HPF coding examples of practical scientific algorithms are examined in detail, with the idea that on these simple but non-trivial examples, we can fairly well understand issues related to different data distributions, different parallel constructs, and different programming styles (static vs dynamic allocations). Coding examples include 2D stencils solution of PDEs, N-body problem, L...

متن کامل

Evaluating PGHPF on the Cray T3D/T3E EPCC-TR98-02

At present, EPCC has access to the Portland Group’s HPF compiler, PGHPF, on the Cray T3D and T3E and on our workstation cluster. We evaluate certain aspects of the compiler which are specific to user’s programs, as opposed to standard benchmarking routines. This work was done in support of the MHD Consortium (led by Dr. Alan Hood) and was funded by the UK’s High Performance Computing Initiative...

متن کامل

A Data-Parallel Formulation for Divide and Conquer Algorithms

This paper presents a general data-parallel formulation for a class of problems based on the divide and conquer strategy. A combination of three techniques—mapping vectors, index-digit permutations and space-filling curves—are used to reorganize the algorithmic dataflow, providing great flexibility to efficiently exploit data locality and to reduce and optimize communications. In addition, thes...

متن کامل

A scalable HPF implementation of a finite-volume computational electromagnetics application on a CRAY T3E parallel system

The time-dependent Maxwell equations are one of the most important approaches to describing dynamic or wide-band frequency electromagnetic phenomena. A sequential finite-volume, characteristic-based procedure for solving the time-dependent, three-dimensional Maxwell equations has been successfully implemented in Fortran before. Due to its need for a large memory space and high demand on CPU tim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998